[[File:Global map of high inequality countries, 2022.png|alt=|thumb|400px|World map of Gini coefficients (as a %), 2022, according to the Poverty and Inequality Platform (PIP) ]]
In economics, the Gini coefficient ( ), also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income inequality, the wealth inequality, or the consumption inequality within a nation or a social group. It was developed by Italian statistics and Sociology Corrado Gini.
The Gini coefficient measures the inequality among the values of a frequency distribution, such as income levels. A Gini coefficient of 0 reflects perfect equality, where all income or wealth values are the same. In contrast, a Gini coefficient of 1 (or 100%) reflects maximal inequality among values, where a single individual has all the income while all others have none.Note: The Gini coefficient could be near one only in a large population where a few persons have all the income. In the special case of just two people, where one has no income, and the other has all the income, the Gini coefficient is 0.5. For five people, where four have no income, and the fifth has all the income, the Gini coefficient is 0.8. See: Bellù, L.G. and Liberati, P. 2006. Inequality Analysis: The Gini Index. EASYPol: Resources for policy making. Rome, FAO.
Corrado Gini proposed the Gini coefficient as a measure of inequality of income or wealth.Gini, Corrado (1936). "On the Measure of Concentration with Special Reference to Income and Statistics", Colorado College Publication, General Series No. 208, 73–79. For OECD countries in the late 20th century, considering the effect of and transfer payments, the income Gini coefficient ranged between 0.24 and 0.49, with Slovakia being the lowest and Mexico the highest. African countries had the highest pre-tax Gini coefficients in 2008–2009, with South Africa having the world's highest, estimated to be 0.63 to 0.7. However, this figure drops to 0.52 after social assistance is taken into account and drops again to 0.47 after taxation. Slovakia has the lowest Gini coefficient, with a Gini coefficient of 0.232. Various sources have estimated the Gini coefficient of the global income in 2005 to be between 0.61 and 0.68.
There are multiple issues in interpreting a Gini coefficient, as the same value may result from many different distribution curves. The demographic structure should be taken into account to mitigate this. Countries with an aging population or those with an increased birth rate experience an increasing pre-tax Gini coefficient even if real income distribution for working adults remains constant. Many scholars have devised over a dozen variants of the Gini coefficient.
He then applied the simple mean difference of observed variables to income and wealth inequality in his work On the measurement of concentration and variability of characters in 1914. Here, he presented the concentration ratio, which further developed into today's Gini coefficient. Secondly, Gini observed that improving methods introduced by Lorenz, Chatelain, or Séailles could also achieve his proposed ratio.
In 1915, Gaetano Pietra introduced a geometrical interpretation between Gini's proposed ratio and between the observed area of concentration and maximum concentration. This altered version of the Gini coefficient became the most commonly used inequality index in upcoming years.
According to data from the OECD, the Gini coefficient was first officially used country-wide in Canada in the 1970s. Canadian index of income inequality ranged from 0.303 to 0.284 from 1976 to the end of the 1980s. The OECD has published more data on countries since the start of the 21st century. The Central European countries of Slovenia, Czech Republic, and Slovakia have had the lowest inequality index of all OECD countries ever since the 2000s. countries also frequently appeared at the top of the equality list in recent decades.
The Gini coefficient is usually defined mathematics based on the Lorenz curve, which plots the proportion of the total income of the population (y-axis) that is cumulatively earned by the bottom x of the population (see diagram). The line at 45 degrees thus represents perfect equality of incomes. The Gini coefficient can then be thought of as the ratio of the area that lies between the line of equality and the Lorenz curve (marked A in the diagram) over the total area under the line of equality (marked A and B in the diagram); i.e., . If there are no negative incomes, it is also equal to 2 A and due to the fact that .
Assuming non-negative income or wealth for all, the Gini coefficient's theoretical range is from 0 (total equality) to 1 (absolute inequality). This measure is often rendered as a percentage, spanning 0 to 100. However, if negative values are factored in, as in cases of debt, the Gini index could exceed 1. Typically, we presuppose a positive mean or total, precluding a Gini coefficient below zero.
An alternative approach is to define the Gini coefficient as half of the relative mean absolute difference, which is equivalent to the definition based on the Lorenz curve. The mean absolute difference is the average absolute difference of all pairs of items of the population, and the relative mean absolute difference is the mean absolute difference divided by the arithmetic mean, , to normalize for scale. If x i is the wealth or income of person i, and there are n persons, then the Gini coefficient G is given by:
When the income (or wealth) distribution is given as a continuous probability density function p( x), the Gini coefficient is again half of the relative mean absolute difference:
where is the mean of the distribution, and the lower limits of integration may be replaced by zero when all incomes are positive.Dorfman, Robert. “A Formula for the Gini Coefficient.” The Review of Economics and Statistics, vol. 61, no. 1, 1979, pp. 146–49. JSTOR, . Accessed 2 Jan. 2023.
A simple case assumes just two levels of income, low and high. If the high income group is a proportion u of the population and earns a proportion f of all income, then the Gini coefficient is . A more graded distribution with these same values u and f will always have a higher Gini coefficient than .
For example, if the wealthiest u = 20% of the population has f = 80% of all income (see Pareto principle), the income Gini coefficient is at least 60%. In another example, if u = 1% of the world's population owns f = 50% of all wealth, the wealth Gini coefficient is at least 49%.
The Gini coefficient can also be considered as half the relative mean absolute difference. For a random sample S with values , the sample Gini coefficient
is a consistent estimator of the population Gini coefficient, but is not in general unbiased. In simplified form:
There does not exist a sample statistic that is always an unbiased estimator of the population Gini coefficient.
and L( x) is the Lorenz function:
then the Lorenz curve L( F) may then be represented as a function parametric in L( x) and F( x) and the value of B can be found by integral:
The Gini coefficient can also be calculated directly from the cumulative distribution function of the distribution F( y). Defining μ as the mean of the distribution, then specifying that F( y) is zero for all negative values, the Gini coefficient is given by:
The Gini coefficient may be expressed in terms of the quantile function Q( F) (inverse of the cumulative distribution function: Q(F(x)) = x)
Since the Gini coefficient is independent of scale, if the distribution function can be expressed in the form f(x,φ,a,b,c...) where φ is a scale factor and a, b, c... are dimensionless parameters, then the Gini coefficient will be a function only of a, b, c.... For example, for the exponential distribution, which is a function of only x and a scale parameter, the Gini coefficient is a constant, equal to 1/2.
For some functional forms, the Gini index can be calculated explicitly. For example, if y follows a log-normal distribution with the standard deviation of logs equal to , then where is the error function ( since , where is the cumulative distribution function of a standard normal distribution).Crow, E. L., & Shimizu, K. (Eds.). (1988). Lognormal distributions: Theory and applications (Vol. 88). New York: M. Dekker, page 11. In the table below, some examples for probability density functions with support on are shown. The Dirac delta distribution represents the case where everyone has the same wealth (or income); it implies no variations between incomes.
0 | ||
Uniform distribution | ||
Exponential distribution | ||
Log-normal distributionFor the log-normal with = 0, = 0; = 0. | ||
Pareto distribution | ||
Chi distribution | ||
Chi-squared distribution | ||
Gamma distribution | ||
Weibull distribution | ||
Beta distribution | ||
Log-logistic distribution | { \left (1+(x/\alpha)^{\beta} \right)^2 } |
is the resulting approximation for G. More accurate results can be obtained using other methods to approximate the area B, such as approximating the Lorenz curve with a quadratic function across pairs of intervals or building an appropriately smooth approximation to the underlying distribution function that matches the known data. If the population mean and boundary values for each interval are also known, these can also often be used to improve the accuracy of the approximation.
The Gini coefficient calculated from a sample is a statistic, and its standard error, or confidence intervals for the population Gini coefficient, should be reported. These can be calculated using bootstrap techniques, mathematically complicated and computationally demanding even in an era of fast computers. Economist Tomson Ogwang made the process more efficient by setting up a "trick regression model" in which respective income variables in the sample are ranked, with the lowest income being allocated rank 1. The model then expresses the rank (dependent variable) as the sum of a constant A and a normal error term whose variance is inversely proportional to y k:
Thus, G can be expressed as a function of the weighted least squares estimate of the constant A and that this can be used to speed up the calculation of the jackknife estimate for the standard error. Economist David Giles argued that the standard error of the estimate of A can be used to derive the estimate of G directly without using a jackknife. This method only requires using ordinary least squares regression after ordering the sample data. The results compare favorably with the estimates from the jackknife with agreement improving with increasing sample size.
However, it has been argued that this depends on the model's assumptions about the error distributions and the independence of error terms. These assumptions are often not valid for real data sets. There is still ongoing debate surrounding this topic.
Guillermina Jasso and Angus Deaton independently proposed the following formula for the Gini coefficient:
where is mean income of the population, Pi is the income rank P of person i, with income X, such that the richest person receives a rank of 1 and the poorest a rank of N. This effectively gives higher weight to poorer people in the income distribution, which allows the Gini to meet the Transfer Principle. Note that the Jasso-Deaton formula rescales the coefficient so that its value is one if all the are zero except one. Note however Allison's reply on the need to divide by N² instead.
FAO explains another version of the formula.
where p j weights the units by their population share, and f( r j) is a function of the deviation of each unit's r j from 1, the point of equality. The insight of this generalized inequality index is that inequality indices differ because they employ different functions of the distance of the inequality ratios (the r j) from 1.
For OECD countries over the 2008–2009 period, the Gini coefficient (pre-taxes and transfers) for a total population ranged between 0.34 and 0.53, with South Korea the lowest and Italy the highest. The Gini coefficient (after-taxes and transfers) for a total population ranged between 0.25 and 0.48, with Denmark the lowest and Mexico the highest. For the United States, the country with the largest population among OECD countries, the pre-tax Gini index was 0.49, and the after-tax Gini index was 0.38 in 2008–2009. The OECD average for total populations in OECD countries was 0.46 for the pre-tax income Gini index and 0.31 for the after-tax income Gini index. Taxes and social spending that were in place in 2008–2009 period in OECD countries significantly lowered effective income inequality, and in general, "European countries—especially Nordic and Continental welfare states—achieve lower levels of income inequality than other countries."
Using the Gini can help quantify differences in Welfare spending and living wage policies and philosophies. However, it should be borne in mind that the Gini coefficient can be misleading when used to make political comparisons between large and small countries or those with different immigration policies (see limitations section).
The Gini coefficient for the entire world has been estimated by various parties to be between 0.61 and 0.68.
The graph shows the values expressed as a percentage in their historical development for a number of countries.
The table below presents the estimated world income Gini coefficients over the last 200 years, as calculated by Milanovic.
+ Income Gini coefficient - World, 1820–2005 |
0.43 |
0.53 |
0.56 |
0.61 |
0.62 |
0.64 |
0.64 |
0.66 |
0.71 |
0.68 |
More detailed data from similar sources plots a continuous decline since 1988. This is attributed to globalization increasing incomes for billions of poor people, mostly in countries like China and India. Developing countries like Brazil have also improved basic services like health care, education, and sanitation; others like Chile and Mexico have enacted more progressive tax policies.
+ Income Gini coefficient - World, 1988–2013 |
0.80 |
0.76 |
0.74 |
0.72 |
0.70 |
0.65 |
Though India's education Gini Index has been falling from 1960 through 1990, most of the population still has not received any education, while 10 percent of the population received more than 40% of the total educational hours in the nation. This means that a large portion of capable children in the country are not receiving the support necessary to allow them to become positive contributors to society. This will lead to a deadweight loss to the national society because there are many people who are underdeveloped and underutilized.
In 2003, Roemer reported Italy and Spain exhibited the largest opportunity inequality Gini index amongst advanced economies.
Shorrocks index is calculated in several different ways, a common approach being from the ratio of income Gini coefficients between short-term and long-term for the same region or country.
A 2010 study using social security income data for the United States since 1937 and Gini-based Shorrock's indices concludes that income mobility in the United States has had a complicated history, primarily due to the mass influx of women into the American labor force after World War II. Income inequality and income mobility trends have been different for men and women workers between 1937 and the 2000s. When men and women are considered together, the Gini coefficient-based Shorrocks index trends imply long-term income inequality has been substantially reduced among all workers, in recent decades for the United States. Other scholars, using just 1990s data or other short periods have come to different conclusions. For example, Sastre and Ayala conclude from their study of income Gini coefficient data between 1993 and 1998 for six developed economies that France had the least income mobility, Italy the highest, and the United States and Germany intermediate levels of income mobility over those five years.
Gini coefficients are simple, and this simplicity can lead to oversights and can confuse the comparison of different populations; for example, while both Bangladesh (per capita income of $1,693) and the Netherlands (per capita income of $42,183) had an income Gini coefficient of 0.31 in 2010, the quality of life, economic opportunity and absolute income in these countries are very different, i.e. countries may have identical Gini coefficients, but differ greatly in wealth. Basic necessities may be available to all in a developed economy, while in an undeveloped economy with the same Gini coefficient, basic necessities may be unavailable to most or unequally available due to lower absolute wealth.
+ Table A. Different income distributions with the same Gini index |
9,000 |
40,000 |
48,000 |
48,000 |
55,000 |
$200,000 |
0.2 |
Even when the total income of a population is the same, in certain situations two countries with different income distributions can have the same Gini index (e.g. cases when income Lorenz Curves cross). Table A illustrates one such situation. Both countries have a Gini coefficient of 0.2, but the average income distributions for household groups are different. As another example, in a population where the lowest 50% of individuals have no income, and the other 50% have equal income, the Gini coefficient is 0.5; whereas for another population where the lowest 75% of people have 25% of income and the top 25% have 75% of the income, the Gini index is also 0.5. Economies with similar incomes and Gini coefficients can have very different income distributions. Bellù and Liberati claim that ranking income inequality between two populations is not always possible based on their Gini indices. Similarly, computational social scientist Fabian Stephany illustrates that income inequality within the population, e.g., in specific socioeconomic groups of same age and education, also remains undetected by conventional Gini indices.
Another limitation of the Gini coefficient is that it is not a proper measure of egalitarianism, as it only measures income dispersion. For example, suppose two equally egalitarian countries pursue different immigration law. In that case, the country accepting a higher proportion of low-income or impoverished migrants will report a higher Gini coefficient and, therefore, may exhibit more income inequality.
+ Table B. Same income distributions, but different Gini Index |
50,000 |
30,000 |
90,000 |
50,000 |
130,000 |
70,000 |
170,000 |
90,000 |
270,000 |
150,000 |
$710,000 |
0.293 |
Deininger and Lyn Squire (1996) show that the income Gini coefficient based on individual income rather than household income is different. For example, for the United States, they found that the individual income-based Gini index was 0.35, while for France, 0.43. According to their individual-focused method, in the 108 countries they studied, South Africa had the world's highest Gini coefficient at 0.62, Malaysia had Asia's highest Gini coefficient at 0.5, Brazil the highest at 0.57 in Latin America and the Caribbean region, and Turkey the highest at 0.5 in OECD countries.
Billionaire Thomas Kwok claimed the income Gini coefficient for Hong Kong has been high (0.434 in 2010), in part because of structural changes in its population. Over recent decades, Hong Kong has witnessed increasing numbers of small households, elderly households, and elderly living alone. The combined income is now split into more households. Many older people live separately from their children in Hong Kong. These social changes have caused substantial changes in household income distribution. The income Gini coefficient, claims Kwok, does not discern these structural changes in its society. Household money income distribution for the United States, summarized in Table C of this section, confirms that this issue is not limited to just Hong Kong. According to the US Census Bureau, between 1979 and 2010, the population of the United States experienced structural changes in overall households; the income for all income brackets increased in inflation-adjusted terms, household income distributions shifted into higher income brackets over time, while the income Gini coefficient increased. Congressional Budget Office: Trends in the Distribution of Household Income Between 1979 and 2007. October 2011. see pp. i–x, with definitions on ii–iii
+ Table C. Household money income distributions and Gini Index, US |
13.7% |
12.0% |
10.9% |
13.9% |
17.7% |
11.4% |
12.1% |
4.5% |
3.9% |
118,682,000 |
0.469 |
While taxes and cash transfers are relatively straightforward to account for, other government benefits can be difficult to value. Benefits such as subsidized housing, medical care, and education are difficult to value objectively, as it depends on the quality and extent of the benefit. In absence of a free market, valuing these income transfers as household income is subjective. The theoretical model of the Gini coefficient is limited to accepting correct or incorrect subjective assumptions.
In subsistence-driven and informal economies, people may have significant income in other forms than money, for example, through subsistence farming or . These forms of income tend to accrue to poor segments of populations in emerging and transitional economy countries such as those in sub-Saharan Africa, Latin America, Asia, and Eastern Europe. Informal economy accounts for over half of global employment and as much as 90 percent of employment in some of the poorer sub-Saharan countries with high official Gini inequality coefficients. Schneider et al., in their 2010 study of 162 countries, report about 31.2%, or about $20 trillion, of world's GDP is informal. In developing countries, the informal economy predominates for all income brackets except the richer, urban upper-income bracket populations. Even in developed economies, 8% (United States) to 27% (Italy) of each nation's GDP is informal. The resulting informal income predominates as a livelihood activity for those in the lowest income brackets.
The Ortego two-parameter model may be superior to the GINI index.
The Gini index is also related to the Pietra index — both of which measure statistical heterogeneity and are derived from the Lorenz curve and the diagonal line.
In certain fields such as ecology, inverse Simpson's index is used to quantify diversity, and this should not be confused with the Simpson index . These indicators are related to Gini. The inverse Simpson index increases with diversity, unlike the Simpson index and Gini coefficient, which decrease with diversity. The Simpson index is in the range 0,, where 0 means maximum and 1 means minimum diversity (or heterogeneity). Since diversity indices typically increase with increasing heterogeneity, the Simpson index is often transformed into inverse Simpson, or using the complement , known as the Gini-Simpson Index.
The Lorenz curve is another method of graphical representation of wealth distribution. It was developed 9 years before the Gini coefficient, which quantifies the extent to which the Lorenz curve deviates from the perfect equality line (with slope of 1). The Hoover index (also known as Robin Hood index) presents the percentage of total population's income that would have to be redistributed to make the Gini coefficient equal to 0 (perfect equality).
The Gini coefficient is sometimes used for the measurement of the discriminatory power of credit rating systems in credit risk management.
A 2005 study accessed US census data to measure home computer ownership and used the Gini coefficient to measure inequalities amongst whites and African Americans. Results indicated that although decreasing overall, home computer ownership inequality was substantially smaller among white households.
A 2016 peer-reviewed study titled Employing the Gini coefficient to measure participation inequality in treatment-focused Digital Health Social Networks illustrated that the Gini coefficient was helpful and accurate in measuring shifts in inequality, however as a standalone metric it failed to incorporate overall network size.
Discriminatory power refers to a credit risk model's ability to differentiate between defaulting and non-defaulting clients. The formula , in the calculation section above, may be used for the final model and at the individual model factor level to quantify the discriminatory power of individual factors. It is related to the accuracy ratio in population assessment models.
The Gini coefficient has also been applied to analyze inequality in dating apps.
extended the concept of the Gini coefficient from economics to reliability theory and proposed a Gini-type coefficient that helps to assess the degree of aging of non-repairable systems or aging and rejuvenation of repairable systems. The coefficient is defined between −1 and 1 and can be used in both empirical and parametric life distributions. It takes negative values for the class of decreasing failure rate distributions and point processes with decreasing failure intensity rate and is positive for the increasing failure rate distributions and point processes with increasing failure intensity rate. The value of zero corresponds to the exponential life distribution or the Homogeneous Poisson Process.
|
|